5
Origins of Life and Earth Prehistory
43
currently estimated as 10 Superscript 7107 (metazoa), 2.5 times 10 Superscript 52.5 × 105 (plantae), 2 times 10 Superscript 52 × 105 (protozoa), and
5 times 10 Superscript 45 × 104 (fungi).
Problem. Estimate the fraction of all possible DNA sequences that are represented
in extant species.
The Problem of Bacterial Identification
Darwin’s notion of species was “a term arbitrarily given for the sake of convenience
to a set of individuals closely resembling each other” (cf. the slightly more formal
notion of quasispecies in sequence space: a cluster of genomes). Since bacteria pre-
dominantly proliferate asexually and can acquire new genetic material rather readily
(“lateral” or “horizontal” gene transfer), the criterion of reproductive isolation that
is rather helpful for defining species in metazoans is of little use. The first systematic
attempt to classify bacteria dates from 1872, when Ferdinand Cohn proposed a system
based on their morphology. The shape of individual bacteria can be easily seen in a
(high-power) optical microscope, and colonies growing on agar plates (for example)
often have characteristic morphologies themselves. Such a scheme can be readily
extended to include features such as pathogenicity and characteristic biochemistry,
and even characteristic habitat. The range of useful attributes depends essentially
on what measuring tools are available. Thus, for example, a classification based on
the compressibility of the bacterium placed between two parallel plates might also
be a useful one. Gram’s stain, which distinguishes between different characteristic
polysaccharides coating the bacterium, is well known. This is a dichotomous classi-
fication, and a hierarchy of dichotomies should lead unerringly to the identification
of a species (provided it is already known). All this knowledge has been captured
in the well-known Bergey’s Manual. Bacteria whose attributes did not match those
already known would be granted the status of a new species.
The advent of molecular biology provided further vastification of the range of use-
ful attributes. In particular, the nucleic acid sequence of the so-called 16S ribosomal
RNA (rRNA), part of the smaller subunit of the ribosome, was used by Carl Woese as
a new way of classifying bacteria and, together with an assumption about the rate of
mutations, could be used to construct a comprehensive phylogeny of bacteria. Bac-
teria seem to vary greatly in their genotypic (and phenotypic) stability, however, and
any classification based on the assumption of relative stability has some limitations. 2
2 See Coenye and Vandamme (2004) and Hanage et al. (2006) for some discussion of the matter;
Trüper (1999) has written an interesting article on prokaryotic nomenclature.